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In this paper we give a new way to quantify the folklore notion that quantum measurements bring 
a disturbance to the system being measured. We consider two observers who initially assign iden- 
tical mixed-state density operators to a two-state quantum system. The question we address is to 
what extent one observer can, by measurement, increase the purity of his density operator without 
affecting the purity of the other observer's. If there were no restrictions on the first observer's 
measurements, then he could carry this out trivially by measuring the initial density operator's 
eigenbasis. If, however, the allowed measurements are those of finite strength — i.e., those measure- 
ments strictly within the interior of the convex set of all measurements — then the issue becomes 
significantly more complex. We find that for a large class of such measurements the first observer's 
purity increases the most precisely when there is some loss of purity for the second observer. More 
generally the tradeoff between the two purities, when it exists, forms a monotonic relation. This 
tradeoff has potential application to quantum state control and feedback. 
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I. INTRODUCTION 

Since the earliest days of quantum mechanics, a com- 
mon idea associated vifith the measurement process has 
been that it necessarily disturbs or interferes with the 
system being observed. For instance Bohr, in his reply 
to the Einstein-Podolsky-Rosen paper [1| , writes that the 
quantum description "may be characterized as a rational 
utilization of all possibilities of unambiguous interpreta- 
tion . . . compatible with the finite and uncontrollable 
interaction between the objects and the measuring in- 
struments" d^. Or Pauh, on a much later occasion, 
writes again, "every act of observation is an interference, 
of undeterminable extent, with the instruments of obser- 
vation as well as with the system observed, and interrupts 
the causal connection between the phenomena preceding 
and succeeding it" |^. See Refs. 0,g] for a more complete 
bibliographic account of this issue. 

Without question, it has also been apparent since the 
earliest days of the theory that these proclamations are 
somewhat dubious. The question is this: What is it that 
is being interfered with or disturbed in a measurement? 
If there were a set of hidden variables underneath the sta- 
tistical predictions of quantum theory, then the answer 
would be at hand: The act of measurement disturbs the 
hidden variables. In the absence of a hidden- variable ex- 
planation Q, however, this becomes a moot point. Mea- 
suring X cannot disturb p ii p does not have an indepen- 
dent existence before a measurement elicits its value [|0| . 
In fact one has to wonder why the word "measurement" 
is used at all in this context: If there are no free-standing 
values X and p to disturb, then surely there are no values 
to measure either. 

Eschewing metaphysical concerns, one might try to 
give a precise sense to the idea that measurements cause 



disturbance by focusing solely on the wavefunction itself. 
For, the wavefunction appears to be the simplest term 
in the theory that would even allow a precise formula- 
tion of the question. One might say for instance, "The 
word measurement is a misnomer for our experimental 
interventions into the course of nature [|ll],|2| . The un- 
predictable wavefunction collapse is the quantitative sig- 
nature of a disturbance in quantum measurement. Since 
the state change is random, the measurement causes an 
uncontrollable disturbance." But this formulation too is 
not without problem. The quantum state resulting from 
a measurement depends in a crucial way on the precise 
form of the measurement interaction [ |l3| . In particular, 
if there is only a single quantum state under scrutiny — as 
was the case in the original Heisenberg uncertainty rela- 
tion discussion [|l4| — or even an unknown state drawn 
from a fixed orthogonal set pq ] , then a measurement in- 
teraction can always be rigged for any observable so that, 
upon completion of the process, the quantum state is re- 
turned to its initial value [Q. It does not matter that 
the measurement outcome is random and unpredictable: 
If the discussion is limited to a single quantum state or 
an orthogonal set, then there need be no disturbance in 
the sense of a necessary wavefunction change. 

What appears to be needed is a situation where more 
than one quantum state from within a nonorthogonal set 
arises naturally into the considerations. Indeed, perhaps 
the first phenomenon to give a precise meaning to the 
idea that information-gathering measurements necessar- 
ily cause an accompanying disturbance is quantum cryp- 
tography [[l7|jlq ]. There it is essential that the systems 
are known to be prepared in one or another quantum 
state drawn from some fixed nonorthogonal set [|l9|-pi|. 
These nonorthogonal states are used to encode a poten- 
tially secret cryptographic key to be shared between the 



sender and receiver. In this case, the information an 
eavesdropper seeks is not about some nonexistent hid- 
den variable hke x or p, but instead about which quan- 
tum state was actually prepared in each individual trans- 
mission. What is novel here is that the encoding of 
the proposed key into nonorthogonal states forces the 
information-gathering process to induce a disturbance to 
the overall set of states. That is, the presence of an ac- 
tive eavesdropper transforms the initial pure states into 
a set of mixed states or, at the very least, into a set of 
pure states with larger overlaps than before. This action 
ultimately boils down to a loss of predictability for the 
sender over the outcomes of the receiver's measurements 
and, so, is directly detectable by the receiver revealing 
some of those outcomes for the sender's inspection. In 
fact, there is a direct connection between the statistical 
information gained by an eavesdropper and the conse- 
quent disturbance she must induce to the quantum states 
in the process. As the information gathered goes up, the 
necessary disturbance also goes up in a precisely formal- 
izable way |2|-|2|] . 

Note the two ingredients that appear in this formula- 
tion. First, the information gathering or measurement is 
grounded with respect to one observer (in this case, the 
eavesdropper), while the disturbance is grounded with 
respect to another (here, the sender). In particular, 
the disturbance is a disturbance to the sender's previous 
information — this is measured by his diminished ability 
to predict the outcomes of certain measurements the le- 
gitimate receiver might perform. No hint of any variable 
intrinsic to the system is made use of in this formulation. 
In itself, this is already a rupture from the founding fa- 
thers' description of disturbance in measurement. As far 
as we can tell, all early literature on the subject refers the 
discussion of disturbance exclusively to the system and 
the invasive measuring device, not to the perspective of 
various observers ^. 

The second ingredient is another break with the found- 
ing fathers. One must consider at least two possible non- 
orthogonal preparations in order for the formulation to 
have any meaning. This is because the information gath- 
ering is not about some classically-defined observable — 
i.e., about some unknown hidden variable or reality in- 
trinsic to the system — but is instead about which of the 
unknown states the sender actually prepared. The les- 
son is this: Forget about the unknown preparation, and 
the random outcome of the quantum measurement is in- 
formation about nothing. It is simply "quantum noise" 
with no connection to any preexisting variable. 

How crucial is this second ingredient, i.e., that there 
be at least two nonorthogonal states within the set under 
consideration? We can start to readdress its necessity by 
making a slight shift in the account above. Divorcing the 
discussion from a cryptographic protocol, one might say 
that the eavesdropper's goal is not so much to uncover the 
identity of the unknown quantum state, but to sharpen 
her predictability over the receiver's measurement out- 
comes. In fact, she would like to do this at the same time 



as disturbing the sender's predictions as little as possible. 
Changing the language still further to the terminology of 
Ref. P4 I, the eavesdropper's actions serve to sharpen her 
information about the potential consequences of the re- 
ceiver's further interventions upon the system. (Again, 
she would like to do this while minimally diminishing 
the sender's previous information about those same con- 
sequences.) In the cryptographic context, a byproduct 
of this effort is that the eavesdropper ultimately comes 
to a more sound prediction of the secret key. From the 
present point of view, however, the importance of this 
change of language is that it leads to an almost Bayesian 
perspective on the information-disturbance problem [ p5[ . 

Within Bayesian probability theory, one of the overar- 
ching themes is to identify the conditions under which a 
set of decision-making agents can come to a common be- 
lief or probability assignment for some specified random 
variable even though the agents' initial beliefs may differ 
|26| . One might similarly view the process of quantum 
eavesdropping. The sender and the eavesdropper start off 
initially with differing quantum state assignments for a 
single physical system. In this case it so happens that the 
sender can make sharper predictions than the eavesdrop- 
per about the outcomes of the receiver's measurements. 
The eavesdropper, not satisfied with the situation, per- 
forms a measurement on the system in an attempt to 
sharpen those predictions. In particular, there is an at- 
tempt to come into something of an agreement with the 
sender but without revealing the outcomes of her mea- 
surements or, indeed, her very presence. 

It is at this point that a distinct property of the quan- 
tum world makes itself known. The eavesdropper's at- 
tempt to surreptitiously come into alignment with the 
sender's predictability is always shunted away from its 
goal. This shunting of various observer's predictability 
(and perhaps only this shunting |g^) is the subtle man- 
ner in which the quantum world is sensitive to our ex- 
perimental interventions. 

This motivates finally the following problem, which is 
the subject of our paper. Suppose two players — let us 
call them Alice and Bob from here out — come to agree 
about the way a quantum system will react to any mea- 
surement. In other words, by Gleason's theorem P8[ , 
suppose they start with an identical density operator as- 
signment p for the system. The case we are interested 
in most is when p is a mixed-state. Under what condi- 
tions can one player — Alice, say — surreptitiously increase 
her knowledge of the system without forcing the other 
player's knowledge to become less relevant? (See Fig.nl) 

To move toward making this question precise, imag- 
ine that a third player will perform some measurement 
on the system in the future, but neither Alice nor Bob 
know which it will be. Depending upon which measure- 
ment is ultimately performed, Alice and Bob will have 
varying degrees of predictability for its outcomes. For 
instance, consider how their predictability fares with re- 
spect to various simple von Neumann measurements. If 
the measurement happens to be the eigenbasis of p, the 



Shannon entropy of the outcomes — which is a good mea- 
sure of predictabihty [p9[ — will be the minimal value it 
can be |3Q]. This turns out to be the von Neumann en- 
tropy S{p) — —trplogp. On the other hand, if the mea- 
surement happens to be a "mutually unbiased" basis | |3l| ] 
to the eigenbasis, then all measurement outcomes will be 
equally probable, and the outcome entropy will be logd, 
where d is the dimension of the system's Hilbert space. 




Unfortunately, the answer is trivial if we leave the ques- 
tion posed in such a simplistic way. For, we need only 
suppose that Alice measures an eigenbasis Ilf, = \b){b\ of 
p to derail the whole program. Upon finding some result 
b, Alice will collapse her description of the system from 
the mixed state p to the pure state |p4] 



Pb = —UbpUb , 

Pb 



(1) 



where pb = tr(pnf,) is the probability of the particular 
outcome. The upshot of this is to make Alice's final pu- 
rity for the system maximal, while as far as Bob is con- 
cerned the system's density operator will not be affected 
at all. This is because, with respect to Bob's state of 
knowledge, the quantum state evolves simply to a mix- 
ture of Alice's states, i.e., 



^PbPb = P , 



(2) 



FIG. 1. Here two observers both ascribe a density matrix 
p to a quantum system. The observer inside the box (Alice) 
makes a measurement on the system without telling the result 
to the observer outside (Bob). Alice wishes to obtain as much 
knowledge about the final state of the system as she can, while 
causing as little disturbance to Bob's state-of-knowledge as 
possible. 

For the purpose at hand, we would like to capture in a 
single number something about how much Alice and Bob 
can predict of the unknown measurement. As a simple 
example, we might average the Shannon entropy of the 
measurement outcomes over the unique unitarily invari- 
ant measure (or "uniform" measure) on the space of von 
Neumann measurements p2|,p3i. This would represent 
how well Alice and Bob will fare on average with respect 
to a completely random von Neumann measurement. Or, 
we might simply consider the entropy of the best case 
scenario, i.e., the von Neumann entropy of p as above. 
Without getting specific — all will be made precise later — 
we will generically call measures of this flavor, measures 
of purity. The main intuition we want to capture is that 
when p = IV') (V' I is a pure state, then Alice and Bob 
should generally have the most predictability over the 
third party's measurements. When p is the "completely 
mixed state" — i.e., proportional to the identity operator, 
p = ij — they should have the least. 

The precise question we want to address is, can Alice 
secretly increase the purity of her quantum state assign- 
ment at the same time as leaving the other player's pu- 
rity unscathed? If she cannot, then such a failure may 
hint at another interesting way to quantify a quantum 
information-disturbance tradeoff. The hallmark of this 
formulation would be that it works even in the case where 
there is only a single initial quantum state (albeit a mixed 
state), while still capturing the shift in language we used 
to reformulate the quantum eavesdropping process. 



and so his purity and indeed his quantum state assign- 
ment remain the same. 

The key to finding something interesting here is to ask 
what would happen in the case where there is a well- 
justified restriction to the class of measurements Alice 
can perform. For instance, suppose Alice has not yet 
reached the technologically advanced stage of being able 
to perform a truly perfect von Neumann measurement. 
Maybe she is using a finite temperature Stern-Gerlach de- 
vice to perform a spin measurement on an electron and, 
because of thermal noise, it every now and then regis- 
ters a spin to be down when it should have registered 
a spin-up. To put this another way, instead of project- 
ing p into the states Ho and Hi as Alice would like, the 
Stern-Gerlach device projects p according to a more gen- 
eral Liider's rule for positive operator valued measures 

(povMs) llllel. 



Pb 



1 

Pb 



-^b P^b 



where in this case 



Eq = kHo + (1 - «.)ni 

£^1 == (1 - K)no + kHi , 



(3) 



(4) 
(5) 



and Pb = iv{pEb). Similarly, the description of the state 
change from Bob's perspective must be in accord with 
this, and so is 



P = ^PbPb 



(6) 



When K is a number strictly between and 1, we will 
call this an instance of a finite- strength quantum mea- 
surement. (We use this suggestive terminology because 
we imagine that Alice can never really get to a perfect 
von Neumann measurement without the expenditure of 



an infinite amount of effort.) Wfiat can be said in a case 
like this? 

Well, again, Alice will be able to generally increase the 
purity of her state without causing any decrease to Bob's 
purity. She does this, as before, simply by choosing Hq 
and Hi to be eigenprojectors of p. Then Eq and Ei com- 
mute with the initial density operator, and it is straight- 
forward to check that p = p. However, we can now ask 
whether this is the strategy that brings the greatest ben- 
efit to Alice. Might it be the case that Alice can increase 
her purity even more on average if she chooses Hq and Hi 
to be noncommuting with p? Moreover if it does, what 
kind of havoc will that wreak on Bob's description of the 
system? What we are imagining here in the imagery of 
the Stern-Gerlach device is that though Alice may not be 
able to chill her magnets to absolute zero, she can at least 
adjust their spatial orientation at will. Is this a freedom 
she should make use of? 

Interestingly, it turns out that there is a tradeoff in the 
two final purities. Whenever p is nonpure (so that there 
is actually something to be "learned") and < k < 1 
(so that the measurement is of finite strength), Alice's fi- 
nal purity will be the greatest on average precisely when 
Bob's purity has decreased the most in turn. Moreover, 
varying through the class of measurements that lead from 
the least average final purity to the most (with respect 
to Alice), we find that Bob's purity goes down mono- 
tonically as Alice's goes up. As we will show, this is an 
example of a more general phenomenon where the mea- 
surement operators are not so restricted as in Eqs. (jj) 
and (ra): for a large class of finite-strength quantum mea- 
surements, a nontrivial tradeoff relation always exists. 

The plan of the remainder of the paper is as follows. In 
Section II, we give a precise formulation of the problem in 
the widest setting, including definitions of various mea- 
sures of purity and also a definition of the general notion 
of a finite strength quantum measurement (without feed- 
back). In Section III, we work out an analytic form for 
the changes of purity for both Alice and Bob under the 
assumption of a particularly simple measure of purity and 
the restriction that Alice's quantum measurements have 
only two outcomes. We then explore the various regimes 
of the convex set of measurements and exhibit the general 
information tradeoff relation where it exists. We close in 
Section IV with a few concluding remarks about the sig- 
nificance of this result. In Appendix A, we prove that 
any efficient measurement (POVM) will increase Alice's 
purity on average (for any measure of purity that is a 
convex function of the density operator's eigenvalues) — 
this result is essentially identical to one proven recently 
in Ref. |^. In Appendix B, for comparison with the 
main result here, we consider a variation of the problem 
where we vary over all measurements of a given finite 
strength instead of only those on the unitary orbits of a 
given fiducial measurement. 



II. FORMULATION 

Our problem concerns two agents, Alice and Bob, who 
initially ascribe a single density operator p to a quan- 
tum system in which they have some interest. The most 
important case for us is when p is a mixed state, i.e., 
trp^ < 1. For generality in the formulation, let us assume 
that p is a density operator over a d-dimensional Hilbert 
space Tid- The detailed considerations begin when Alice 
tries to surreptitiously increase her "knowledge" of the 
system — that is, to obtain a new density operator that 
is closer to being a pure state than her initial ascription. 
The only way she can do this is by performing a quan- 
tum measurement behind Bob's back. To be as generous 
as we can be without trivializing the problem, let us as- 
sume that Bob knows everything of Alice's plan, even 
her precise measurement interaction. The only informa- 
tion barred from Bob is the precise outcome of Alice's 
measurement. 

The formalism for treating the most general kind of 
quantum measurement is that of the positive operator 
valued measure, or POVM |Q. In this formalism a mea- 
surement corresponds to a sequence of operators on Tid — 
denoted by £ = {Ei,)^^ — with a finite number of nonva- 
nishing Eh, such that each element of the sequence is a 
positive semi-definite operator, i.e.. 



{ib\Eb\^)>0, V|^) 



(7) 



and, together, the elements form a resolution of the iden- 
tity. 



b 



E. 



(8) 



The outcomes of the measurement are specified by the 
index b and occur with probabilities pb = trpEi,. 

Upon finding an outcome 5, the laws of quantum me- 
chanics specify that Alice's state can evolve into any 
other density operator of the form |16| : 



P 



Pb 



1 
Pb 



Z^ 



Ab^pAl , 



where 



Y^AlAu = Eb 



(9) 



(10) 



Since Bob knows nothing of Alice's outcome, as far as he 
is concerned the state of the quantum system will evolve 
according to 



P 



P = ^PbPb 

b 



Y.'^b^pAl 



(11) 

(12) 



b.i 



Note that the decomposition of each Et into the oper- 
ators Abi in Eqs. (O) and (lin) depends crucially upon the 



interaction Alice chooses for carrying out the measure- 
ment £. Whenever the range of the index i is restricted 
to a single value, we say that Alice's measurement is an 
efficient one [|39| . 

Efficient quantum measurements (with respect to a 
given £) correspond to holding on to as much informa- 
tion as possible in the measurement process. That is to 
say, such measurements do not break quantum coherence 
more than is necessary for the given POVM. In the sub- 
sequent development we will only consider efficient mea- 
surements for just this reason. Hence, in the language of 
equations, we will only consider conditional state changes 
of the form 



where AlA 



b^b 



P 



Eb. 



Pb 



= —AbpAl , 
Pb 



(13) 



By the polar decomposition theorem for operators [^ 
we can always write 



Ab = UbE^ 



1/2 



(14) 



where Ub is a unitary operator. This decomposition can 
be endowed with a physical meaning if one thinks of the 
measurement process as allowing for a sort of feedback 
to the quantum system. The raw measurement causes a 
"collapse" 



P 



CTfo 



1 

Pb 



-El'-pEl'- 



(15) 



in one's description of the system. But then, conditioned 
upon the outcome, one can think of the interaction as 
causing the system to further unitarily evolve to 



o-fe — > Pb = UbCJbUl 



(16) 



This split, of course, is a conceptual one: it may or may 
not correspond to the actual workings of the device carry- 
ing out the measurement £. Nevertheless, it can be quite 
useful for classifying different kinds of measurement in- 
teraction. 

Efficient measurements without feedback hold a spe- 
cial place in our considerations. These are measurement 
interactions for which all the Ub = /, so that Alice's state 
change is ultimately of the simple form 



P^-—^b P^b 

Pb 



(17) 



These hold a special place for us first and foremost be- 
cause they correspond to the "rawest" kind of measure- 
ment interaction allowed for a given POVM. Therefore, 
they are worthy of study in their own right ||3^. Sec- 
ondly, though, there are other problems for which they 
correspond to the least perturbing implementation of a 
POVM. Namely, if one contemplates performing the 
measurement on a system initially prepared in a com- 
pletely random pure state, then the mean input-output 



fidelity will be the greatest if the measurement has no 
feedback ^,Q. Finally, it stands to reason that if we 
can get a handle on the tradeoff between information 
and disturbance for such a special case, we will be better 
prepared for understanding the more general one of an 
arbitrary efficient measurement. We will also be better 
prepared to understand the precise role of feedback for 
controlling quantum systems ||43| ]. 

Our focus hereafter will be on efficient measurements 
without feedback. What is the strength of such a mea- 
surement? This issue is explored in Ref. B3], where a 
more refined notion of the concept is given a quantita- 
tive formulation. For the purposes here, we will only 
need the rawest of distinctions: finite vs. infinite mea- 
surement strength. An efficient measurement £ is said to 
be of finite strength as long as each nonvanishing Eh has 
support on the whole Hilbert space Tid — that is, as long 
as 



rankii^b = d , for all b such that E), ^ 



(18) 



A measurement is of infinite strength any time one of the 
nonvanishing E^s has rank strictly less than d. 

The utility of this notion comes about from noting that 
the set of all POVMs is a convex set. This follows from 
the fact that one can invent a notion of convex addition 
operation for POVMs: Simply take H 



pf + (1 - p)T = {pEb + (1 - p)Fb 



b=l 



(19) 



By a similar consideration, it is also true that the set 
of all POVMs with a fixed number n of nonvanishing 
elements E^ is a convex set. Thinking of this set as em- 
bedded in the space of length-n sequences of all Her- 
mitian operators, one has that the boundary of such a 
set is given by precisely what we are calling the infinite 
strength measurements (with n outcomes). The finite 
strength measurements lie strictly within the interior of 
the set. Making this identification in terminology is an 
attempt to capture the idea that an experimenter would 
need to expend an infinite amount of effort or money to 
work his way out to the boundary. For, if he could get all 
the way to the edge, there would be some preparations of 
the system for which he could predict with absolute cer- 
tainty that some outcomes of the measurement would not 
occur. That strikes us as beyond the power of mortals. 
As technology advances, we can imagine experimental- 
ists getting ever closer to the boundary, but never quite 
getting there. 

Let us now start applying these distinctions of mea- 
surement to the problem at hand — namely, to that of an 
Alice trying to surreptitiously increase her "knowledge" 
of a system while affecting Bob's "knowledge" of it as 
little as possible. How shall we quantify "knowledge" in 
this context? There are at least three canonical ways. 

The first has to do with the von Neumann entropy of 
a density operator p: 



S{p) = -tr p log p = - ^ Afc log Afc 



(20) 



fc=i 



where the A^ signify the eigenvalues of p. (We evaluate 
all logarithms in base 2 so that information is measured 
in bits, rather than nats or hartleys Q]. Also, we use 
the convention that A log A — whenever A = so that 
S{p) is always well defined.) 

The intuitive meaning of the von Neumann entropy 
can be found by first thinking about the Shannon en- 
tropy. Consider any von Neumann measurement V con- 
sisting of d one-dimensional orthogonal projectors 11^. 
The Shannon entropy for the outcomes of this measure- 
ment is given by 



^(^) = ~Y. (*'■/'".) log (trpn, 



(21) 



This number is bounded between and log d, and there 
are several reasons to think of it as a good measure of 
impredictability over the outcomes of a measurement V. 
Perhaps the most important of these is that it quantifies 
the number of yes-no questions one can expect to ask 
per measurement, if one's only means to ascertain the 
measurement outcome is from a colleague who knows the 
actual result ||2^ . Under this quantification, the lower the 
Shannon entropy, the more predictable a measurement's 
outcomes. 

A natural question to ask is: With respect to a given 
density operator p, which measurement V will give the 
most predictability over its outcomes? As it turns out, 
the answer is any V that forms a set of eigenprojectors 
for p Ig^l- When this obtains, the Shannon entropy of the 
measurement outcomes reduces to simply the von Neu- 
mann entropy of the density operator. The von Neumann 
entropy, then, signifies the amount of impredictability 
one achieves by way of a standard measurement in a best 
case scenario. Indeed, true to one's intuition, one has the 
most knowledge by this account when p is a pure state — 
for then S{p) = 0. Alternatively, one has the least knowl- 
edge when p is proportional to the identity operator — for 
then any measurement V will have outcomes that are all 
equally likely. 

The best case scenario for predictability, however, is a 
very limited case, and not so very informative about the 
density operator as a whole. Since the density operator 
contains, in principle, all that can be said about every 
possible measurement [E8|, it seems a shame to throw 
away the vast part of that information in our considera- 
tions. 

This issue leads to our next quantification of "knowl- 
edge" of a quantum system. For this, we again rely on the 
Shannon information as our basic notion of predictability. 
The difference is we evaluate it with respect to a "typical" 
measurement rather than the best possible one. However 
with this, a new question arises: Typical with respect to 
what? The notion of typical is only defined with respect 
to a given measure on the the set of measurements. 

Luckily, there is a fairly canonical answer. There is 
a unique measure df2n on the space of one-dimensional 
projectors that is invariant with respect to all unitary 



operations. That in turn naturally induces a canonical 
measure dQ-p on the space of von Neumann measure- 
ments V |p2 33|. Using this measure gives rise to the 
following quantity 



H{p) = f H{U)dfl-p 



d / (trpll) log (trpll) d^u 



(22) 
(23) 



which is intimately connected to the so-called quantum 
"subentropy" of Ref. [46[. Interestingly, this mean en- 
tropy can be evaluated explicitly in terms of the eigen- 
values of p and takes on the expression 



H{p) = 



1 



1 1 



ln2 V2 3 
where the subentropy Q{p) is defined by 



Q{p) 



k=l \i^k 



k ■ 



(24) 



(25) 



In the case where p has degenerate eigenvalues, A; = Am 
for I ^ m, one need only reset them to A; -I- e and A„i — e 
and consider the limit as e -^- 0. The limit is convergent 
and hence Q{p) is finite for all p. With this, one can also 
see that for a pure state p, Q{p) vanishes. Furthermore, 
since H{p) is bounded above by logd, we know that 



0<Q(p) <logd- 



-(- 
ln2 V2 



d - In 2 



(26) 



where 7 is Euler's constant. This means that for any p, 
Q{p) never exceeds 0.60995 bits. 

The interpretation of this result is the following. Even 
when one has the maximal knowledge about a system 
one can have under the laws of quantum mechanics — 
i.e., when one has a pure state — one can predict almost 
nothing about the outcome of a typical measurement WTl . 
In the limit of large d, the outcome entropy for a typical 
measurement is just a little over a half bit away from 
its maximal value. Having a mixed state for a system, 
reduces one's predictability even further, but indeed not 
by that much: The small deviation is captured by the 



function in Eq. (25), which becomes a quantification of 
"knowledge" in its own right. 

The two quantifications of knowledge about a quantum 
system given by Eqs. ( po| ) and (g5|) are without doubt 
two of the most well-motivated such quantities. However, 
because of their particular mathematical structures (in- 
volving logarithms and ratios of eigenvalues, etc.), they 
are often difficult to work with. It is therefore useful to 
consider quantities F{p) that may not have the strictest 
of interpretations in terms of "knowledge" or "informa- 
tion," but nevertheless carry some of the properties es- 
sential to the explorations we would like to make. The 
two properties that appear to be the most important for 



us is that a function F from density operators to real 
numbers be (1) unitarily invariant so that it only de- 
pends upon the eigenvalues of the density operator, and 
(2) concave in its argument. That is, one should have 

F{ppo + (1 - p)pi) > pFipo) + (1 - p)F(pi) , (27) 

for each pair of density operators po a-nd pi, and each 
real number p in the range [0, 1]. 

A common way of simplifying problems to do with the 
Shannon entropy is to consider instead a function that is 
merely quadratic in the probabilities ^,^ ■ In quantum 
mechanical terms, this translates to a function we shall 
call the impurity of a quantum state: 



P(p) = l-tr(p2) = l_^A: 



(28) 



fc=i 



This function, of course, has our two desired properties 
pQ|. Moreover, it attains its minimum value of when 
p is a pure state (just as S and Q do), and it attains its 
maximum value of ^^ when p is the completely mixed 
state. 

What makes unitarily invariant functions like the F{p) 
in Eq. (|27| ) special is that one can prove an interesting 
theorem for them in the measurement context. Consider 
any efficient measurement of a POVM £ = {Eb}. Upon 
finding an outcome b, the observer will update his quan- 
tum state for the system from the original p to some pb 
of the form in Eq. (Oh. What does this say about his 
expected change of knowledge? Well, one can prove that, 
whatever F is. 



F{p)>J2PbF{pb) 



In particular, it follows that 



S{p) > ^pbS{pb) , 

b 

Q{p)>J2PbQ(Pb)^ 
b 

Pip)>J2PbPiPb)- 



(29) 

(30) 

(31) 
(32) 



These statements — and in fact a stronger statement to 
do with a majorization relation between the eigenvalues 
of p and those of the pb — will be proven in Appendix A. 
(See also Nielsen Q for an earlier proof of this result.) 
The fact that Eq. (^^ holds for all concave functions 
F expresses what is meant by the phrase "the observer 
learns something from a quantum measurement" [ pO[ . 
Note in particular that it need not necessarily be the case 
that the purity, etc., be nondecreasing in any individual 
trial of a measurement. A simple counterexample suffices 
for illustration. Take, 



1/1 
3U 2 



(33) 



and consider a two-outcome efBcient measurement with- 
out feedback £ = {E, I — E) where 



E 



1/2 
3I0 1 



(34) 



Note that if outcome E occurs, the updated density op- 
erator for the system will be the completely mixed state 



1/1 



^^ " 2 V 1 



(35) 



which is certainly less pure than the initial state. Thus, 
one can only expect one's "knowledge" to increase on 
average during a measurement. 

Going back to our target scenario with Alice and Bob, 
one can see that this result insures that Alice comes away 
on average with more information than she started with. 
Moreover, this holds independently of the particular way 
in which we choose to quantify her "information." To 
make some notation, this means that the quantities 



l^l^^ F{p) -Y,PbF{pb 



(36) 



will all be nonnegative for any efficient measurement. 
The subscript on A'f^ denotes that this refers to the 
change of knowledge from the "inside" point of view of 
the measurer. 

An almost dual result is that from Bob's point of 
view — the outside point of view — whenever £ is not only 
an efficient measurement, but also a measurement with- 
out feedback^ his information can never increase from Al- 
ice's actions. That is to say, using notation from Eq. (|ll]), 
the quantity 



A^ 



F{p) ~ F{p) 



(37) 



is nonnegative for all concave unitarily invariant func- 
tions F ||5l[| . Again, the subscript in A^^^^ makes explicit 
that we are referring to a change of knowledge from the 
outside point of view. (The interested reader can find a 
proof that A''^^, > in Ref. Q.) 

We must emphasize that this result is almost dual to 
Eq. (P9[): for it certainly depends upon the assumption 
that the measurement is without feedback. Let us show 
this by way of a quick counterexample. Take £ to be 
a complete set of orthogonal projectors Eb — |6)(6|, b = 
1, . . . , d. One possible measurement with feedback that is 
consistent with this POVM is given by taking Ab — \'4}) {b\ 
for some fixed unit vector |'0). Clearly A\Ab = Eb as 
required. However, 



^a,p4 = |V')(VI 



(38) 



completely independently of what the initial state p is. 
So it can certainly be the case that F{p) < F{p) if one 
allows feedback into the picture. 



The conclusion to draw is that we are right on track 
in considering the quantities A^^^ and A^^^ in the con- 
text of measurements without feedback: in a sense, they 
are compensatory of each other. What we would like to 
do now is sharpen that idea. For, just because Alice's 
knowledge of the system can only increase through her 
measurements and Bob's can only decrease, it does not 
follow that there is necessarily a monotonic relation be- 
tween these adjustments. 

Here is how we will tackle the problem explicitly. As 
has been the case since the beginning, we imagine the 
initial state of knowledge for Alice and Bob fixed to be 
some density operator p. Now, however, we introduce 
a fiducial quantum measurement M = (A/f,) that will 
also be fixed throughout our considerations. The freedom 
we give Alice is that she may perform any measurement 
without feedback that is unitarily equivalent to A4 . That 
is to say, we shall consider measurement operators for 
Alice that are necessarily of the form 

Eb = C/Mbf/t , (39) 

where U is any unitary operation. Each different U de- 
fines a consequent change in both Alice and Bob's total 
information which we denote by Af^([/) and A^^^^(f/), re- 
spectively. (This notation makes no reference to p and 
M because they are fixed background information for 
the problem.) What we would like to know is: Under 
what conditions is there a nontrivial monotone relation 
between A^^(C/) and Nl^^{U) as we vary U7 In the cases 
where such a monotone relation exists, that will be the 
tradeoff we have been seeking. 

This completes the formulation of our problem. Un- 
fortunately, as opposed to the formulation, we have not 
settled the issue of a tradeoff relation in complete gen- 
erality. Study of the two-dimensional Hilbert-space case, 
however, already turns out to be of significant interest. 
In the next Section, we report a careful study of the case 
where d = 2 and A4 contains two outcome operators Mq 
and Ml. Even in this restricted class, there is a large 
regime of measurements with a nontrivial information 
tradeoff relation. 



III. THE 2-D TWO-OUTCOME PROBLEM 

In this section, we assume explicitly that d = 2, so 
that Alice and Bob's information is about a single qubit. 
The canonical measurement Ai that sets Alice's stan- 
dard is taken to consist of only two elements Mq and 
Ml, but is otherwise completely general. Alice now has 
the freedom to choose any unitary operation [/, and con- 
sequently perform any measurement £ consisting of ele- 
ments Eq = UMqW and Ei = UMiW. The question we 
should like to address is how A^^([/) and N'^^^iU) change 
with respect to each other as a function of U . 

Note that because there are only two outcomes to the 
measurement, Eq and Ei must commute. There is there- 
fore only one diagonalizing basis required in specifying 



this measurement. Let us relabel the measurement to 
make that more explicit: We shall simply denote the two 
outcomes by E and I — E. With our previous definitions, 
this measurement is of finite strength when neither E nor 
I — E is a rank-1 operator. 

We have performed extensive numerical work that 
shows the following when p is impure and £ is of finite 
strength. For the three concave functions S{p), Q{p), and 
P{p) considered in the previous section, there are signif- 
icant regions in POVM space where A'[^(C/) achieves its 
maximum value precisely when A^^j([/) is nonminimal. 
That is to say, Alice cannot learn the most unless she 
also disturbs Bob's information in the process. In this 
situation, the optimal measurement operator E does not 
commute with p. Alternatively, when E commutes with 
p, the difference A'^^j([/) achieves its minimum value, 
namely — so that Bob's information is not disturbed at 
all — but then A^^(L'^) achieves its minimum value too — so 
that Alice has learned the least amount possible. In gen- 
eral, the functional relationship A^^j(A^jJ is a monotonic 
one as A^^ ranges from its minimum to its maximum 
value. In those regions of POVM space where there is 
no nontrivial tradeoff relation, the curve for A^^^j^A^j^) is 
simply ffat. 

What we shall do herein is focus on quantifying the 
tradeoff explicitly for the case in which "knowledge" is 
identified with the impurity function P{p) of Eq. (pq). 
In this case, all calculations can be done analytically and 
one can get a feel for the exact form of things. (In the 
other cases oi F = S or F = Q, things are not terribly 
worse, but because the binary Shannon entropy function 
cannot be inverted analytically, there is no way to get 
an analytic expression for the function A^^j(A'[^).) With 
this restriction, we will hereafter drop the superscript F 
from our notation and write simply A^^t and Ai„ for the 
"information" changes we are considering. 

Let us start the calculations straight away. From 
the inside point of view of Alice, the two possible state 
changes are of the form 



1 



and 



P 



P 



P^E = 



trpE 



EpVE , 



1 



VT^^pvT^^ 



(40) 



(41) 



1 - tr pE 
^From the outside point of view of Bob, it is simply 

p — > p = ^p^/E+^/I -Ep^/I -E . (42) 



Keeping in mind that vE and ^/I — E commute, a little 
algebra yields that 



A„„t ::= tr p^ - tr p^ 



tr p^E -tr pEpE 



-tr{p^E{I-E)p^E{I-E) 



(43) 



(44) 



Similarly, 



(45) 



iipE{l-ivpE) 



iipE{l-iipE) 



tr pEpE + tr p^ tr p£; 



2trp^trp2£; 



trp2 (46) 



irpEpE-2irpEiYp^E 



+trp^ (trp£;)^ 



(47) 



Note immediately that if E and p comm.utc, then A„ut 
vanishes as one would expect. 

Since we are dealing with a two-dimensional Hilbert 
space, it is most convenient at this point to switch to a 
kind of Bloch-sphere notation for all operators. Then we 
may write. 



P= ^^I + o--(r) , 



where a — [a^, ay,az) is some vector of real numbers with 
modulus a < 1 and a is the vector of Pauli operators. 
Similarly, if a = tvE, then the operator 



1 
B ^ -E 
a 

is a density operator, and we may write 



B = l{l + b-a), 



(49) 



(50) 



where b also has a length b no greater than unity. In 
this notation, E and p commute if and only if b and a lie 
within the same ray. 

Since < £^ < /, we must have < a < 2. More- 
over, we must insure that the larger eigenvalue of E is no 
greater than unity. Using the fact that the eigenvalues 
of E in Bloch-sphere notation are given by ia(l ± 6), it 
follows that we must require 



a < 



1 



(51) 



One can see £ becomes an infinite strength measurement 
whenever 6=1 and a is any value, or whenever 6 < 1 but 
a — 2/(1-1-6). The parameter a to some extent captures 
the amount of symmetry between the two measurement 
operators E and I — E. It is therefore natural to call the 
case where a — I the symmetric case. 

With the notations of Eqs. ( p9| ) and ( |50| ) it becomes 
a tractable task to calculate the various operators in 
Eqs. (ph and (H^). Using the law of multiplication for 
Pauli matrices, i.e. 



-J, i.V^., 



(m ■ (j)(n ■ a) = (to • n)I + ia ■ (m x ft) , (52) 

one finds fairly easily that: 



trp2^-(l + a2) 
tr pE = -(1 + abz) 
tvp^E^ j(l + a^ + 2abz) 



(53) 
(54) 
(55) 



tvpEpE == — (l + a^ + 6^ + a'^b^{2z^ - 1) + 4a6z) (56) 
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where z — cos 9, and 9 is the angle between the vectors 
a and 6. 

The only really daunting term that we must calculate 
is the quantity 



tr (^p^E{I~E)p^E{I-E) 
To make some headway, let 



G = E{I-E)=goI + g-S 



(48) where 



go = t"(2- a- ctb^) 
9 = -a{l-a)b . 

We need to find an ro and r such that 
vG = ro/ + r ■ <T . 



(57) 

(58) 

(59) 
(60) 

(61) 



The method for this is simple: We just need calculate 
G — \fG\fG and set the resultant equal to Eq. ([Sq). 
Carrying this procedure to its conclusion, we arrive at 
the following identifications: 



2-a 



- ab^ + y^(l-62)(4-4a+(l-62)a2) 

(62) 



r = - — a(l — a)6 . 
4ro 



With this, we can finally calculate 
Ir 



(63) 



tr y/Gp\/Gp = 



l + a^ +&■ + 2{d-c)'^ ~ a-^d^ -^ 4a • c 



(64) 



where the vector c and its magnitude c are defined by 

c = — . (65) 

Putting all these ingredients together, we finally arrive 
at our sought-after expressions: 



A.„ = 



ab'^{l-a^){l~a^z^) 
2{1 + abz){2 - a - aabz) ' 



(66) 



and 



1 f aah\ 



{l~af+Arl (l-z^) 



(67) 



These two equations contain everything needed for a 
complete analysis of the information tradeoff question. 
Let us first see how this plays out for the simple case 
described in Eqs. (0) and (0) of the Introduction. 



A. The Symmetric Case 

In this case the measurement operators Afg and Mi 
take the form 



Afo = kHo + (1 - k)Wi 
Ml = (1 - K)no + kHi 



(68) 
(69) 



where < k < 1, and XIq and IIi are the projectors onto 
some orthonormal basis. Measurement operators of this 
form come up quite naturally in the theory of continuous 
quantum measurements BS]. In our Bloch sphere nota- 
tion of Eqs. ( |49| ) and (|50|), this case corresponds to taking 
a = \ and 6 = 2k — 1. 

Plugging a = 1 into Eqs. ( pq ) and (|67|), we find the 
significantly simpler expressions 



A,„ 



-h' I 



1 



a^z'^ 



1 — aP'h'^z^ 



and 



A„„, = ^&V(l-z2) 



(70) 



(71) 



Clearly, A^^t is minimized when z == 1 or — 1 (so that 
E commutes with p) as we have noted before. Moreover 
Ao„t is maximized when z = — that is to say, when 
the operator E is diagonal in a basis complementary or 
mutually unbiased to the diagonal of p. On the other 
hand, since b < 1, Aj,, is a strictly decreasing function in 
z"^ . This gives Ai„ the same qualitative behavior as A„ut 
and ultimately leads precisely to our tradeoff relation: 
Eliminating z"^ from Eqs. ([70[) and (|7l|), we obtain 



A„ 



2(l-a262)A„-&2(i_a2)2 



2 1 



2A„ 



(72) 



This example is something of an extreme for the phe- 
nomena we have been hoping for. As long as the fidu- 
cial measurement M is of finite strength (i.e., b ^ 1), 
Eq. (|72| ) traces out a nontrivial monotone curve as we 
go from A"j" to A™. But more than this, Ai„ is maxi- 
mized at precisely the same value of z for which A^^t is 
also maximized. In common language, this means that 
if Alice wishes to gather the most information, she must 
reciprocally cause Bob to loose the most information that 
her class of measurements will allow. The only means for 



Alice to lessen the sting of this effect is to develop her 
technology so that the limit 6^1 can be approached 
asymptotically. 

This behavior, at first sight, appears to be quite deep. 
It helps lend credence to the idea that measurements 
without feedback are always somewhat destructive by 
their nature — that is, as long as one's aim is to increase 
one's information as much as possible under the con- 
straint of having less than "infinitely powerful" measure- 
ment devices. 

Interestingly, however, this type of behavior is not com- 
pletely generic. There are some fiducial measurements M 
of finite strength for which the tradeoff effect disappears. 
To see this, we must turn back to our base equations 
Eqs. (H) and (IgT 



B. The General Case 

Let us now assume strictly that none of the variables a, 
h, or a happen to equal unity. Then, as in the symmetric 
case, the quantity A^^t is clearly minimized when z'^ — 1. 
Similarly, the disturbance to Bob's knowledge is largest 
when z = 0, so that E is diagonal in a basis mutually 
unbiased with respect to the diagonal of p. 

The analysis of the general Ai„ is significantly more 
difhcult. One can show that the quantity is minimized 
at z^ = 1, but whether that occurs atz = lorz = — 1 
now depends upon the size of a. The way to see this is 
by checking that Ai„ is concave as a function of z: The 
calculation is tedious, but it can be done analytically. 
The point where the curve changes from a positive slope 
to a negative slope, i.e., where the function attains its 
maximum, is given by 



1 



zo 



a{l — a)ab 



Ari - a(2 - a - ab 



(73) 



This expression is quite revealing. For a fixed value 
of 6 7^ 0, one can check for those values of a that force 
zq = 1 or zo = — 1. These are 



Uo = l 



and 



5(1 + a2) + 2a 
6(l + a2)+a(l + 62) 

b{l + a'^)~2a 
5(1 + a2) -a(l + 62) 

This means that for a in the ranges 



a 



< a < max< 0, 



b{l + a^)-2a 
6(l-ha2)-a(l + 52) 



and 



5(l + a2)-2a 2 

^ ' < a < 



6(l + a2)-a(l + 62) 



1 + 5 



(74) 



(75) 



(76) 



(77) 
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FIG. 2. The tradeofl between information, Ai„, and distur- 
bance, Aout, is plotted here for a = 1, 6 = 0.9 and three values 
of a. Solid line: a = 0.8, dotted line: a = 0.79 and dot-dash 
line: a = 0.78 . 

Ai„ will always be maximized by choosing z = 1 or 
z = — 1. However, for a outside of either of those ranges, 
there will always be a nontrivial tradeoff relation: When 
Alice's information gain Ai„ is maximized, Bob's infor- 
mation loss A„„t will be strictly greater than its minimal 
value. 

The general tradeoff relation, when it exists, is found 
simply enough by eliminating the variable z from the 
simultaneous equations (pq) and (p^) . (Two examples of 
the tradeoff relation are given in figures H a nd 0.) This 
time — in contrast to what we did in Eq. ( [72| ) however — 
we leave finding the explicit expression as an exercise for 
the reader: Seeing it explicitly adds little to the analysis 
already given. 



FIG. 3. The tradeoff between information, Ai„, and dis- 
turbance, Aout, is plotted here for a = 1, 6 = 0.1 and three 
values of a. Solid line: a — 0.8, dotted line: a = 0.79 and 
dot-dash line: a — 0.78 . 

threshold. That is to say, in carrying out the program 
of this paper, we would imagine not only varying over 
unitary orbits for defining a tradeoff relation, but rather 
over any region of POVM space so long as a certain con- 
straint on the measurement strength is obeyed. Unfortu- 
nately, if this is going to be the case, it is going to require 
some thinking more subtle than we have carried out so 
far. This is because for at least one natural definition of 
measurement strength we again find no nontrivial trade- 
off relation. The failure of this program is described in 
Appendix B. So the question remains. 

In general, this paper forms part of a larger effort to 
fully delimit the information-disturbance tradeoff prop- 
erties of quantum mechanics. 



IV. DISCUSSION 

Our conclusion is straightforward: There are regions in 
the space of finite-strength efficient measurements with- 
out feedback for which a nontrivial information tradeoff 
relation exists as one unitarily varies around any given 
fiducial measurement M. In a way, it is a shame that 
we could not make a more unqualified assertion — for in- 
stance, that a nontrivial tradeoff relation held for all fi- 
nite strength quantum measurements without feedback. 
Indeed the hope that such would be the case was a large 
part of the motivation for this work. 

The question now arises as to the significance of the 
rather complicated regions defined by Eqs. (JT^) and (77). 
What trenchant physical property is implied of a mea- 
surement A4 that sits in the information-disturbance re- 
gion of a given density operator p? 

A toy idea is that the key distinction lies not in finite 
vs. infinite measurement strength, but in whether the 
measurement sits above or below a certain finite-strength 



APPENDIX A: EFFICIENT MEASUREMENTS 
INCREASE ALICE'S INFORMATION 

In this appendix, we prove that for any efficient quan- 
tum measurement, an observer aware of the outcomes 
will on average increase his "knowledge" of the measured 
quantum system. More precisely, when a measurement 
causes the observer to update his density operator from p 
to pb — as in Eq. (O) — it holds for any concave unitarily 
invariant function F that 



F{p)>J2PbF{pb) 



(78) 



Along the way, and as something of an aside, we will also 
prove a stronger result that deals directly with relations 
between the eigenvalues of p and all the pt- This result 
is most conveniently couched in terms of the mathemat- 
ical theory of majorization [ p2[ , and will require a little 
notation for its statement. 
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Let us define X{0) to be the vector of eigenvalues of 
a Hermitian operator O on Tid-, with the components 
arranged in terms of decreasing magnitude. That is to 
say, let the numbers Xi{0) obey the ordering 



Xi{0) > X2{0) > ■ ■ ■ > Xd{0) 



(79) 



We say that a vector X{0) is majorized by a vector X{J\f), 
and write 



A(0) -< X{Af) 



when 



J2xdO)<J2H^) 

i=l i=l 

for all fc = 1,2, ... ,d, and 

d d 

^A.(0) = ^A.(AA). 

1=1 1=1 



(80) 



(81) 



(82) 



One can also say that a Hermitian operator O is ma- 
jorized by a Hermitian operator M , and write O -< M, 
when X{0) -< X{M), but we will not have any need for 
that terminology in this development. 
Our main result is this: 



Hp) ^ ^PbX{pb) 



(83) 



(As pointed out before, this result has also been obtained 
recently in Ref. p^.) The proof of this statement is not 
too difficult if we rely on some results from the math- 
ematical literature [M, and a method of thought pro- 
moted by Ben Schumacher on several occasions to great 
result — see Refs. M,pJ], to name only a few. 

The trick of Schumacher is this. Whenever we have 
a quantum system Q and we say that it is in a (mixed) 
state p, there is nothing to prevent us from thinking that 
the situation has come about because Q is part of a larger 
system RQ, which we happen to describe via some pure 
state \ip^^). The state p then is just a partial trace over 
the larger pure state: 



p = trH|V'"«)(V"'=| 



(84) 



There are times when such a conception can be quite 
useful for simplifying the mathematics of a problem. The 
problem at hand is one of them. 

Let us now describe the measurement process on a sys- 
tem Q from such a point of view. It will be useful to make 
explicit precisely which system we are referring to at any 
given time: therefore we shall add superscripts or sub- 
scripts, R, Q, or RQ to all density operators to make 
that clear. In these new terms, the state change under 
measurement that we are interested in is given by 



,Q - Itl T^^/^p'iE'J''ul 



Pb 



,1/2, 



(85) 



after an outcome h is found. That same measurement, 
on the other hand, changes the state of the RQ system 
according to: 



\r 



iv^D 



-(/,®A)|V'"''), (86) 

Pb 



where 



Ab = UbE, 



1/2 



(87) 



(Recall that pure states remain pure under an efficient 
measurement.) The operator /r in this equation, of 
course, signifies the identity operator on the R system. 

Note that the initial density operators p^ and p'-^ for 
the R and Q systems are unitarily equivalent. I.e., 

p^=trQ|V"«)(^"«|=Fp«V^^ (88) 

for some unitary operator V. In particular, it follows 
that p^ and p'^ have the same eigenvalues. We can also 
note, however, that since a measurement on Q can have 
no overall effect on the R, it must be the case that 



.■^^trjj^p.iv^nw' 



(89) 



One can see this more formally by choosing a Schmidt 
decomposition for IV'"''), 



(90) 



fc=i 



(91) 
(92) 



Then 

= Y,{qi\{h,® Ab)\r''){r%I..® Al)\qi 
lb 

= Y^ \/x'k\/X^\rk){rrn\{qi\Ab\qk){qm\Al\qi) (93) 

kmlh 

= Y V>^k\/KJ,rk){r„,\{qrn\Al\qi){qi\Ab\qk) (94) 

kmlh 

kmb 

= / ^ V ^kV Kn\'^k)\1^m\0mk 
km 

^YXk\rk){rk\ 



(95) 

(96) 

(97) 
(98) 



It follows from Eq. (|89|) and the statement preceding 
it that 



A(p«) = A(p-) = A tr J 5]p.|^r)(V'n 



(99) 
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But trq is a linear mapping. So defining 
we have 



kp"") ^ ^ [J2p''Pb 



(100) 



(101) 



Now comes the point where we rely on the mathemat- 
ical literature ever so slightly by using Ky Fan's domi- 
nance theorem |32| . One can show that for any Hermitian 
operator O, 



E 



Xi(O) = max trPO 
p 



(102) 



where the maximization is taken over all rank-/c projec- 
tors. It follows from this almost immediately that 



X{0 + JV) ^ X{0) + X{JV) 



(103) 



since 



max trP(0 + TV) < max trPO + max trPTV . (104) 



It follows from this that 

x{p^)^Y.^{p,pi)^j2p>'^(Pb) 



(105) 



since X{cO) — cX{0) for any positive number c. 

Noting finally that the eigenvalue spectrum of p^ is the 
same as that of p'j^ , we have ultimately 



Kp'') ^T^pb^^Pb) 



(106) 



Stripping off the superscript Q, we have the desired result 
Eq. (p3|), and the theorem is proved. 

It comes about as a corollary to Eq. (p3[), through 
some theorems in Ref. M] that our most desired result — 
namely Eq. ([Tg) — holds for any concave unitarily invari- 
ant function F. However, there is a more direct way to 
see this, and it seems wor thwh ile to take that route. We 
need only back up to Eq. (101). From this it follows that 



F{p'=^) ^ F (y2pbp?j 

But, F is concave and so 

Fip^') > Y.P,F[pl) 

b 

= Y.P^Fip'^)- 



(107) 

(108) 
(109) 



Again, stripping off the superscript Q, we obtain the de- 
sired result. 



Note Added in Proof: Instructive though it is to derive 
Eq. ( p3|) by first extending the problem to an ancillary 
Hilbert space, there is an even shorter route to the result 
that is worth recording. The trick to note is this: With 
each efficient measurement £ = (Eb) = (AlAb), we can 
associate a canonical decomposition of the density oper- 
ator starting from the fact that the Ei, form a resolution 
of the identity. Starting from the equation 



I-E^b 



(110) 



one simply multiplies it from the left and right by p^'^ 
to get 



P = ^PbUJb 
b 



where 



uJb = —p 
Pb 



'/^Ebp'/^ 



(111) 



(112) 



and Pb — tipEb as always. 

Using the Ky Fan dominance theorem just as before, 
but now on Eq. (|lllD, we have straight away that 



(113) 



Hp) -< ^PbM^b) 



However, it is an easy matter to see that the operators 
p^^^Ebp^^^ and AbpAl have precisely the same eigenvalue 
structure. Just start off with the eigenvalue equation 

(114) 



{p'/'AlAbp'/')\^)=p,\^) 



Multiplying this from the left by Abp^^^ and regrouping 
terms, one gets 



AbpAl{Abp'/^\t)) ^ tH{Abp'^^\i)) 



(115) 



which means that p^/^Ebp^^"^ and AbpA\ have the same 
eigenvalues. Using this, Eq. (gSj) follows immediately. 



APPENDIX B: FULL VARIATION OVER 
MEASUREMENTS OF A GIVEN STRENGTH 

For this Appendix, we drop the distinction of finite vs. 
infinite measurement strength and attempt to grade all 
measurements via a single finite number. One possible 
notion of such a measurement strength is the amount by 
which Alice's purity would change if p happened to be 
the maximally mixed state i J — that is, the measurement 
strength would be her change of knowledge if she starts 
out completely ignorant of the system. We can do this 
with respect to any of the functions F in Eq. (|2^) , but for 
convenience we will again adopt the impurity P to be the 
main function of interest. Also for convenience, we will 
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actually adopt two times the said quantity above, i.e., 
2Ai„(i/). This choice of pre- factor causes our notion of 
measurement strength to range in the full interval [0, 1]. 
Thus, using Eq. ( |66|) and taking a = 0, a given mea- 
surement strength k for a two-outcome measurement 
{E, I-E) is defined by 



k = 



2-a 



(116) 



The question we shall pose in this Appendix is the follow- 
ing. For a given quantum state p and a fixed measure- 
ment strength fc, what is the maximum value of Ai„ and 
what values of z achieve that maximum? In particular, 
can we show that the optimal values for z^ in this prob- 
lem are strictly less than 1? Unfortunately, we will have 
to answer the latter question in the negative, regardless 
of the value a defining the purity of the initial density 
operator. 

This is seen as follows. Fix k anywhere in the range 
between and 1. For a fixed b this means that a must 
take on the value 



2k 



b^ + k ' 



(117) 



Note that for a fixed value of k we are not allowed to 
freely choose b as we wish. This is because for a fixed fc, 
the variable b cannot be too small or we would never be 
able to satisfy Eq. (116). The valid range for b turns out 
to be 



k<b<l 



(118) 



The consideration leading to this is simple. The function 
2^— is monotonically increasing in a. So to find our 
smallest value of 5, we should place the largest al lowe d 
value of a, Eq. (pl|), i nto t he right-hand side of Eq. ( 117 ). 
Doing this gives Eq. (IIS). 

Plugging Eq. (117) into Eq. ( |66| ) gives a surprisingly 
simple expression 



fc(l-Q^)(l-fl^Z^)b 

2(1 + azb){b - akz) 



(119) 



Let us now examine the behavior of this as a function 
of b. Taking the partial derivative with respect to b, we 
obtain 



2 9Ai„ 

fc(l-a2)(l-a2z2) "db' 



a{b'^ -F k)z 



{l + azbfib-akzf ■ 
(120) 



Therefore Ai„(6) toggles from being an increasing to a 
decreasing function depending upon the sign of z. Thus 
maxb Ai„(6) takes on a piecewise form. If z > 0, we 
should choose 6 = fc; if z < 0, we should choose 6=1. 
The resultant of these choices is conveniently summarized 
as follows: 



2 1-1- ak\z\ 



(121) 



The function A"^'"'(z) in Eq. (121) is increasing in \z\ 
since fc < 1. Hence it finally follows that the very best 
strategy on Alice's part for a given measurement strength 
k is to take z = 1 or — 1. Doing so gives her an absolute 
maximum purity change of 



at: - 2^^ ^^^T 



ak 



(122) 



and that purity change is accompanied by a purity chan 
of A„„t = for Bob. 



Lge 
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